TA2N: Two-Stage Action Alignment Network for Few-Shot Action Recognition
نویسندگان
چکیده
Few-shot action recognition aims to recognize novel classes (query) using just a few samples (support). The majority of current approaches follow the metric learning paradigm, which learns compare similarity between videos. Recently, it has been observed that directly measuring this is not ideal since different instances may show distinctive temporal distribution, resulting in severe misalignment issues across query and support In paper, we arrest problem from two distinct aspects -- duration evolution misalignment. We address them sequentially through Two-stage Action Alignment Network (TA2N). first stage locates by affine transform, warps each video feature its while dismissing action-irrelevant (e.g. background). Next, second coordinates match spatial-temporal performing temporally rearrange spatially offset prediction. Extensive experiments on benchmark datasets potential proposed method achieving state-of-the-art performance for few-shot recognition.
منابع مشابه
A Generative Approach to Zero-Shot and Few-Shot Action Recognition
We present a generative framework for zero-shot action recognition where some of the possible action classes do not occur in the training data. Our approach is based on modeling each action class using a probability distribution whose parameters are functions of the attribute vector representing that action class. In particular, we assume that the distribution parameters for any action class in...
متن کاملk-Shot Learning for Action Recognition
In the problem of k-shot learning, a model must learn to reliably classify an example having seen only k previous instances of examples of the same class. With recent success in using memory in neural networks to perform kshot learning, we propose a technique that uses MemoryAugmented Neural Networks to perform k-shot learning for action recognition in videos. We believe the use of memory will ...
متن کاملAlternative Semantic Representations for Zero-Shot Human Action Recognition
A proper semantic representation for encoding side information is key to the success of zero-shot learning. In this paper, we explore two alternative semantic representations especially for zero-shot human action recognition: textual descriptions of human actions and deep features extracted from still images relevant to human actions. Such side information are accessible on Web with little cost...
متن کاملOne-Shot Learning for Real-Time Action Recognition
The goal of the paper is to develop a one-shot real-time learning and recognition system for 3D actions. We use RGBD images, combine motion and appearance cues, and map them into a new overcomplete space. The proposed method relies on descriptors based on 3D Histogram of Flow (3DHOF) and on Global Histogram of Oriented Gradient (GHOG); adaptive sparse coding (SC) is further applied to capture h...
متن کاملOne Shot Similarity Metric Learning for Action Recognition
The One-Shot-Similarity (OSS) is a framework for classifierbased similarity functions. It is based on the use of background samples and was shown to excel in tasks ranging from face recognition to document analysis. However, we found that its performance depends on the ability to effectively learn the underlying classifiers, which in turn depends on the underlying metric. In this work we presen...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2022
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v36i2.20029